Waiting Cycle Times and Generalized Haldane 
Equality in the Steady-state Cycle Kinetics of 

Single Enzymes 

Hao Ge* 
April 15, 2009 

Abstract 

Enzyme kinetics are cyclic. A more realistic reversible three-step mechanism of 
the Michaelis-Menten kinetics is investigated in detail, and three kinds of waiting 
cycle times T, r+, are defined. It is shown that the mean waiting cycle times 
(T), (r+), and (r_) are the reciprocal of the steady-state cycle flux J^*, the forward 
steady-state cycle flux and the backward steady-state cycle flux respectively. 
We also show that the distribution of r+ conditioned on T+ < r_ is identical to the 
distribution of T_ conditioned on r_ < T_|_, which is referred as generalized Haldane 
equality. Consequently, the mean waiting cycle time of T+ conditioned on r_|_ < T_ 
((T+|T+ < r_}) and the one of T_ conditioned on r_ < r+ |T_ < T+)) arc both 
just the same as (T). In addition, the forward and backward stepping probabilities 
are also deflned and discussed, especially their relationship with the cycle fluxes 
and waiting cycle times. Furthermore, we extend the same results to the n-step cycle, 
and finally, experimental and theoretically based evidences are also included. 

KEY WORDS: waiting cycle times; generalized Haldane equality; single-molecule 
experiment; nonequilibrium steady states; cycle flux; stepping probability 

1 Introduction 

Living cells function thermodynamically as open systems that are far from static thermal 
equilibrium, since cells must continually extract energy from their surroundings in order 
to sustain the characteristic features of life such as growth, cell division, intercellular 
communication, movement and responsiveness to their environment. 
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Prom the view of statistical physics, these stochastic models for systems biology exhibit 
nonequilibrium steady states (NESS) in which nonequilibrium circulations (cycle fluxes) 
necessarily emerge |36j . Hong Qian and his co-workers have recently discussed the relation 
between an NESS and traditional nonlinearly dynamics [38] , B2l [371 [39] . 

The researches on irreversible systems far from equilibrium began with the works by 
Haken [16l [T7| about laser and Prigogine, etc. [151 El] about oscillations of chemical 
reactions. It is closely related to another concept of macroscopic irreversibility in nonequi- 
librium statistical physics. A macroscopic irreversible system in a steady state should have 
positive entropy production rate and should be in nonequilibrium. 

T.L. Hill, etc. [181 HI EOl EI] constructed a general mesoscopic model for the combi- 
nation and transformation of biochemical polymers in vivid metabolic systems since 1966. 
Their results can be applied to explain the mechanism of muscle contraction and active 
transports [TO] . 

Mathematical theory of nonequilibrium steady states and circulation (cycle fluxes) 
has been discussed for several decades since the original work |43[ HSj H6] . in which 
Qian and co-workers developed the formulae for entropy production rate and circulation 
distribution of homogeneous Markov chains, Q-processes and diffusions, and moreover 
their relationship with reversibility. They concluded that the chain or process is reversible 
if and only if its entropy production vanishes, or iff there is no net cycle fluxes. Here, we 
recommend a recent book [26] for the systematic presentation of this theory. 

Recently, we investigate the synchronized stochastic dynamics of a network model of 
yeast cell-cycle regulation [13], applying the mathematical theory of cycle fluxes (circu- 
lation) of Markov chains. In our model of yeast cell cycle, the trajectory concentrates 
around a main cycle with the dominant circulation, which we call stochastic limit cycle, 
that is the natural generalization of the deterministic limit cycle in the stochastic system. 

Recent advances in single-molecule spectroscopy and manipulation have now made 
it possible to study enzyme kinetics at the level of single molecules, where the stochas- 
tic effects, termed as "dynamic disorder", are significant. Experimentalists can not only 
directly measure the distributions of molecular properties through single-molecule experi- 
ments rather than the ensemble average, but also apply the theory of stochastic processes 
to analyze the statistical properties of the stochastic trajectory [29 1 1 ^ [50 l [5T]. 

Xie, et.al [311 [321 [9l [28] observed that the mean waiting time is the same as the 
reciprocal of the Michaelis-Menten steady-state flux (i.e., the cycle flux in my language). 
But the model they built in their theoretical analysis is the simplest irreversible Michaelis- 
Menten mechanism, and the state space of their stochastic model (Markov chain) actually 
only contains two states {E and ES) , which does not distinguish the two different pathways 
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from ES to E and is always in mathematical detailed balance rather than chemical detailed 
balance. That is just why they can only directly using the ordinary differential equations 
to get the explicit distribution function f(t) of the waiting time, and avoid applying the 
strong Markov property, which is the basic method to compute mean waiting times in 
stochastic processes. So their method can not be generalized to more complicated cases, 
and generally speaking, the explicit distribution function f(t) can rarely be obtained in 
such an analytic form. 

In the present paper, a more realistic reversible three-step mechanism of the Michaelis- 
Menten kinetics is investigated in detail, and three kinds of waiting cycle times T, T+, r_ 
are defined. It is shown that the mean waiting cycle times (T), (T+), and (T_) are the 
reciprocal of the steady-state cycle flux J^**, the forward steady-state cycle flux and 
the backward steady-state cycle flux Jf.** respectively. 

We also show that the distribution of T-^ conditioned on < T_ is identical to the 
distribution of T_. conditioned on T_ < T^, which is referred as generalized Haldane 
equality [H]. This is a key result of this work. There is experimental evidence for it, as 
well as theoretical models proving equal mean time [Sj \27\ |28] . 

Consequently, the mean waiting cycle time of T+ conditioned on T+ < T_ ((T+IT^ < 
r_)) and the one of T_ conditioned on T_ < T-^ {{T- |T_ < T-^ )) are both just the same as 
(T). In addition, the forward and backward stepping probabilities p~^,p~ are also defined 
and discussed, especially their relationship with the cycle fluxes and waiting cycle times. 
Furthermore, we extend the same results to the n-step cycle, and finally, experimental and 
theoretically based evidences are also included. 



2 Single enzyme kinetics: cycle flux and NESS 

This subsection is just a brief introduction to our model and cycle fluxes, which is from 
Ref. [36]. 

We consider a more realistic three-step mechanism of the Michaelis-Menten kinetics in 
which the conversion of S into P in the catalytic site of the enzyme is represented as a 
process separate from release of P from the enzyme (Fig. H^a)): 

E + S ^ ES ^ EP A E + P. (1) 

k-i A;_2 fcOg 

If there is only one enzyme molecule, then from the enzyme perspective, the kinetics 
are stochastic and cyclic, as shown in Fig. [I](b), with the pseudo-first-order rate constants 
ki = k^cs and fc_3 = k^^cp where cs and cp are the sustained concentrations of substrate 
5 and P in the steady state. 
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Figure 1: Kinetic scheme of a simple reversible enzyme reaction (a) in which and 
are second-order rate constants. From the perspective of a single enzyme molecule, the 
reaction is unimolecular and cyclic (b). The pseudo-first-order rate constants ki = k^cs 
and /c_3 = k^^cp where cs and cp are the concentrations of substrate S and P in the 
steady state. 

At the chemical equilibrium, the concentrations of S and P satisfy ^ = ]~^~^u~^ 

i.e. 

kikok'i , , 

1. (2) 



k^ik-2k-3 

This is the "thermodynamic box" in elementary chemistry, also known as Wegscheider's 
relation and detailed balance. However, if the cs and cp are maintained at constant levels 
that are not at chemical equilibrium, as metabolite concentrations are in living cells, the 
the enzyme reaction is in an open system that approaches a NESS. This is the scenario in 
enzyme kinetics. 

In this case, 

kik2ks 

7/1, (3) 



k-ik-2k-3 

and = kpT In j is well known as the cellular phosphorylation potential. 

Prom the perspective of single enzyme molecule, the rate equation for the probabilities 
of the states is a master equation 
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^^1^ = -(ki + k-3)PE{t) + k.iPEsit) + hPEp{t) 

dPEsit) 



dt 

dPEp{t) 



kiPEit) - + k2)PEsit) + k^2PEp{t) 

k-3PE{t) + k2PEs{t) - {k^2 + k3)PEp{t) (4) 
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The steady-state probabilities for states ES and iJJP are easy to compute by setting 
the time derivative to zero and noting that Pe + Pes + Pe p = 1 for the total probability. 



^e 



ES 



^ EP 

tiii%2 -|- 1^21^3 "I" '^3'VI "I" '«-lft'-3 "I" '*'-2'*;-3 "I" i^~li^-2 T i^li^-2 ^ '^2'^-3 "I" '^3'^-" 

(5) 

Then, the clockwise steady-state cycle flux in Fig. [Hb), which is precisely the enzyme 
turnover rate of 5 ^ P in Fig. HJa), J'' = Pf fci - PUsk-i = PI5/C2 - PEpk-2 = 
Pep^3 ~ PEk-3, which follows 



jss ^ kik2k3 - k_ik_2k-3 ^ J^'^-J'"' 

kik2 + k2k3 + k3ki + k^ik-3 + k^2k-3 + k-ik^2 + kik^2 + k2k-.3 + k3k^i + 

(6) 

where 

J: 



ss _ kik2k3 



kik2 + ^2^3 + kski + k-ik^3 + k-2k-3 + k-ik-2 + kik-2 + k2k-3 + k3k_i 
is the forward cycle flux, and 

^ k-ik-2k-3 

kik2 + k2k3 + kski + k-ik-3 + k-2k-3 + k-ik-2 + kik-2 + A;2^-3 + ^^3^-1 ' 

is the backward cycle flux. 

The net cycle flux is just the Michaelis-Menten steady-state flux of ([T|), i.e. 

_ kscs - kpcp 

wVioT-o h — k\k2kz , _ fc-ifc-2fc°3 _ fc-lfc-2+fc-lfc3+fc2A:3 i 

wneie fi,5 - fc_-^fc_2+A:_iA:3+fc2fc3 ' ~ fc_ifc_2+fc-iA:3+fc2fc3' '^'S' ~ fcO(A;_2+fe2+fc3) ' 
= ^.llt'+l^og^ - That is just Eq. (2.46) in 
In addition, and Jl'' can be rigorously proved to be the averaged numbers of the 
forward and backward cycles per time respectively due to ergodic theory [26^ Theorem 
2.1.2], i.e. 
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J- = limii.(t), 

t^oo t 

Jf = lim -u+(t), 

t~^oo t 

rj = Ihnju^it), (7) 

where iy+{t) and z^-(t) are the number of occurrences of forward and backward cycles up 
to time t, and z^(t) = i^+{t) — i^-{t). 

At the end of this section, it is important to notice that the quantity 7 can be ap- 
proximated by in single-molecule experiment when the time t is large enough, due 

jss 

to the fact that 7 = -ps- and = J^^ (i.e. 7 = 1) if and only if this system is at chemical 
equilibrium. 



3 Waiting cycle times and generalized Haldane equality 

3.1 Mean waiting cycle times 

The most obvious feature of the turnover trajectory (Fig. IB in is its stochastic 
nature, exhibited both in the time needed for a chemical reaction which takes place on 
the subpicosecond time scale and that needed for diffusion and thermal activation which 
is much more longer. In the single- molecule experiment [29) . the emission on-time and off- 
time recorded correspond to the "waiting time" for the turnover reactions, respectively. 
Once holding the statistical data of the trajectory in hand, the most straightforward 
analysis of the trajectories is certainly the distribution of the on-and-off times, so in our 
theoretical model, waiting cycle times should be defined and their mean should also be 
calculated at the first step. 

Starting from the free enzyme state E, three kinds of waiting cycle times can be defined. 
T represents the waiting time for the occurrence of a forward or a backward cycle, 
represents the waiting time for the occurrence of a forward cycle, and T_ represents the 
waiting time for the occurrence of a backward cycle respectively. Obviously, T is just the 
smaller one of and T_ . 

The problem of computing the mean waiting time (T) can be transferred into an 
important application of first-passage-time(FPT) methods (Figl2]) to the cyclic chemical 
transformations, in particular single-enzyme kinetics (FiglTJb)). 

FPT problems are been well studied, and there are analytical results for first-passage 
times in a discrete-time one-dimensional asymmetric random walk for quenched disorder 
|35j . But actually what we investigated in the present paper is the continuous-time case 
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Figure 2: The kinetic scheme for computing the waiting cycle times. In order to distinguish 
the forward and backward cycles, Fig. [T]is transferred into a one-dimensional random walk 
model. 

rather than the discrete-time case in |35j, and the master equation ([8|) below is different 
from \35\ Eq.(2.6)]. Therefore, the expression of (T) cannot be regarded as a particular 
case of the more general FPT problem when M = L = 2 in [35^ Eq.(2.11)] although they 
are quite similar. 

Let Tj be the mean time first hitting the state 3 or —3 in FiglJl starting from the state 
i. Obviously, (T) = tq and = r„3 = 0. 

Applying the strong Markov property of continuous-time Markov chains [Ij, {r,} sat- 
isfies the following equations 



(T) 



T-2 = T — TT + J. — TT ^ + Z TTT^-i' 

1 /c_2 ^3 

^"""^ k-2 + k3 fc_2 + ^3 ^ k-2 + ks °' 

1 ki 
k-3 + ki fc_3 + ki '^^'^ k-3 4- fci 
1 k_i k2 

'^^ k-i+k2 k^i + k2 ° fc_i + /c2''"^' 

1 ko k^ , , 

fc_2 + K_2 + k-2 + /C3 

Through simple calculation, one can get that 

_ k\k2 + ^2^3 + k3k\ + k-ik-3 + k-2k-3 + k-ik^2 + ^1^-2 + k2k-3 + fca/c-i _ 1 
~ kik2k3 + k^ik^2k-3 ~ JFTj! 
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Similarly, another mean waiting cycle time (?+), which is the mean time to complete 
the forward cycle in Fig[l]||b) whether before or after reaching an "analogous" final state in 
the opposite direction, can also be obtained as solutions of the nearly identical equations 
to dH) but with modified boundary conditions. Let rj_|_ be the mean time first hitting the 
state 3, whether before or after the time hitting the state —3 in FiglJl starting from the 
state i. Obviously, (T+) = ro+, r3_|_ = and t„3+ = to+. 

Applying the strong Markov property of Markov chains again, {Tj_|_} satisfies the fol- 
lowing equations 



/c_i + fe2 + k2 k_i + fe2 

1 k_2 ^3 

'T-1+ = 1 —, — I- 1 —r'^-'i+ + 1 r~r'^^+^ 
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k^ 


-2 + ^3 




-2 + kj, 




1 


+ 




k-3 


k^ 


-3 + ^1 




-3 + ki 




1 


+ 




k-i 


k^ 


^i + k2 


k. 


^i + k2 




1 


+ 




k-2 



k_ 


-i + k2 




ks 


k^ 


-2 + ^3 




ki 



0+ ), , 7. "I" 7. i 7. 1+ ^ + kl 

k2 

' fe_i + k2 

T2+ = , 1 , + , , ^1+ + , ^1 . X 0> (9) 

k-2 + k^ k-2 + K3 k-2 + kz 

which gives that 

_ kik2 + /c2^3 + ^3^1 + k-ik-i + k-2k-3 + k-ik-2 + A;iA;_2 + ^2A;-3 + ksk-i _ 1 
^ +^ ~ hhh ~ J^' 

Almost the same derivations can be achieved for (T_), which is the mean time to 
complete the backward cycle in FiglUb), whether before or after reaching an "analogous" 
final state in the opposite direction, immediately follows 

_ kik2 + k2k3 + kski + k-ik-s + k-2k-3 + k-ik-2 + kik-2 + k2k-3 + ksk-i _ 1 

^ ~ k-ik-2k-3 ~ J¥' 

Surely, the expression of {T-) can be directly derived due to the symmetry of the 
random walk in Fig. [2j 

The quantitative relationship between the mean waiting cycle times ((T), (T^), and 
(r_)) and the cycle fiuxes (J***, J^, and J^) in this subsection is the first chief result of 
the present paper. 

Consequently, = (T_) if and only if this system is at chemical equilibrium, because 
of 7 = l^y- Therefore, 7 can also be measured by the ratio of averaged forward and 
backward waiting cycle times up to time t in the single- molecule experiment, which is 
different from the measure method introduced at the end of the previous subsection. 
Nonetheless, applying the elementary renewal theorem [HI Sec. 3. 4, Theorem 4.1,4.2], the 
two methods are asymptotically the same because (T+) ^ ^^'^ (^-) ~ TTJt) ^^^n t 
is large. 
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3.2 Stepping probability 

The stepping probabilities p'^{t) and p^{t) up to time t are just the fractions of and 
representing the weights of the forward and backward cycles respectively from the 
statistical point of view in experiments, i.e. 



According to EqlTl one can get the eventual stepping probability 

+ def +, . J+ kik2k3 

p = lim p \t) 



t^oo' ^ ' Jf + J!f kik2k3 + k-ik-2k-3 ' 

p- hm = - ^-^^-^^-3 no) 

^ /-I™ ^ J- + J- - /feifeafca + k^ik^2k^3 ■ ^ ^ 

It is necessary to point out that the stepping probabilities p'^{t) and p^{t) are random 
variables depending on the trajectories, while their fluctuations tend to vanish when t 
tends to infinity. Hence the eventual stepping probability p'^ and p~ are independent with 
the trajectories due to the ergodic theory. 

Interesting, the forward stepping probability can also be defined as p^ P|^|(T_|_ < 
r_), which means the probability that the particle first completes a forward cycle before 
a backward one starting from the initial free enzyme E. Similarly, the backward stepping 

_ def 

probability can be defined as p~ = P^e}{T- < This is the second chief result of the 
present article. 

This equivalence can be explicitly seen through translating this problem to a corre- 
sponding one of the random walk in Fig. O either. 

Let pi+ be the probability of hitting the state 3 before —3 in FiglJl starting from the 
state i. Obviously, p3+ = 1 and p-3+ = 0. 

Again applying the strong Markov property of Markov chains as what we have done 
in the precious section, {pi+} satisfies the following equations 



k-i k2 

P-2+ = 1 — X + —P-1+, 

fe_i + fc2 

ks 

K-2 + ks 

= ^—^P-'^^kl^ThP'^^ 

ks 

P2+ = -1-PI+ + 
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Through simple calculation, one can get that 

kik2k3 



kik2ks + k-ik-2k-s 
and 

p- = P^E^{T^ > T_) = 1 - P{E}{T+ < r_: k.^k.2k.s 



Consequently, 



P 



p 



hhh + ^-1^-2^-3 
Jf (T) 



JS_S _^ JSS ^J^^^ ' 



and 



j)+ JSS /rp_ \ 

A^i = kBT log 7 = fcijr log ^ = A:bT log = kgT log )— ^ , 

p- Ji' (r+) 

which follows p"*" = if and only if this system is at chemical equilibrium. 
3.3 Generalized Haldane equality 

To avoid the unnecessary difficult mathematical details, we apply a simple trick like the 
"time-reversal mapping" always used in modern statistical physics O [U [71 [22l [231 [Ml [IS] 
instead of the rigorous language of measure theory. 

We introduce a one-to-one mapping r for the trajectory of the simple kinetic in Fig. 
m which belongs to the event {T-|- < T_}, mapped to its "quasi-time-reversal" one. 

For each trajectory to = {tot ■ t > 0, luq = {E}} belonging to the set {T+ < T_}, let 
T* be the last time when it leaves the state {E} before finishing a forward cycle in the 
Figlljb). Then its "quasi-time-reversal" one ruj = {{rLo)t : t > 0} is defined as follows: 

i) when the time t is before or equal to T* , then one just copy iv to ruj, i.e.{ruj)t = uJt] 

ii) when the time t is between T* and T+, then one maps the real time-reversal trajectory 
of Lo with respect to the time interval [r*,T+] to ru), i.e. {ruj)t = ujt*+t+-u 

iii) when the time t is greater than r+, then one can also simply copy uj to as what we 
have done in (i). 

See Fig. [3] for an illustrative example. As having been pointed out on this figure, T* is 
denoted to be the last time when it leaves the state {i?} before finishing a forward cycle 
E ES EP E. Then the ratio of the probability density of the above trajectory 
with respect to its "quasi-time-reversal" one below is 

_ {kik-ik-^k^) X {kik2k-2k2k3) _ kik2ks 
^ (kik-ik-^ks) X {k-3k-2k2k-2k-i) fc_ife_2fc_3' 

Now it is indispensable to explain why we construct the above mapping like this. 
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E-ES-E-EP-E-ES-EP-ES-EP-E 

k-i k_-| k_g kg k-| k2 ^-2 ^2 ^3 
0— 1 - 0- 2 ^ 0-*1j-2 - 1 - 2 

V 

Time-reversal 




t 

T* 

V 



E-ES-E-EP-E-EP-ES-EP-ES-E 

k-| k_-| k_g kg ^-3 ^-2 ^2 ^-2 ^-1 

0- 1-0-2^0-2 - 1 - 2 ^ 1 ^0 



Figure 3: An illustrative example of the "quasi-time-reversal" map. T* is the last time 
when it leaves the state {E} before finishing a forward cycle E — > ES — > EP E, then 
one maps the real time-reversal trajectory of uj with respect to the time interval [T*,T4_] 
to ru!. See text for details. 

1) The number of the steps E — > ES in the original trajectory u belonging to 
{r^ < T_} is one more than that in its "quasi-time-reversal" corresponding trajectory 
rco belonging to {T-^ > T_}, while the number of the steps ES — > in u; is one less than 
that in ru; 

similarly, 

2) The number of the steps ES EP in the trajectory uj is one more than that in 
riv, while the number of the steps EP ES in the trajectory uj is one less than that in 
ruj; 

3) The number of the steps EP E m the trajectory uj is one more than that in ruj, 
while the number of the steps E EP in the trajectory uj is one less than that in ruj; 

and more important 

4) The dwell time upon each state of the trajectory uj and its "quasi-time-reversal" 
corresponding one ruj is mapped quite well such that the difference between uj and ruj are 
only exhibited upon their sequences of states. 

Consequently, the most important observation is that the ratio of the probability den- 
sity of each trajectory uj in {T+ < r_} with respect to its "quasi-time-reversal" trajec- 
tory ruj in {T-|_ > r_} is invariable, which is surprisingly always equal to the constant 

I k-ik-2k-3 ' 

Rigorous proof needs to be expressed in the language of measure theory, especially 
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applying the Radon-Nikodym derivative similar to [26i Lemma 2.2.7], so more details is 
omitted here. 

Furthermore, the map r is a one-to-one correspondence between the trajectory sets 
{T-|_ < T_} and {T+ > r_}. More particularly, for each t > 0, the map r is also actually a 
one-to-one correspondence between the trajectory sets {T+ = t < T_} and {T+ > T_ = t}. 

Therefore, for each t > 0, 

P{E}{T+ = t,T+< r_) = ^P{E}{T. = t,T^< r+), 

and 

p+ = P{E}{T+ < T_) = jP{E}{T- < T+) = 7p-, 

which has already been proved in the above section. 

Denote the conditional probability density of T-f given that {T-^ < T_} as = 
P{E}{P+ = < ^-)) and the conditional probability density of T_ given that {T_ < 
r+} as G_(t) = P{E}{T- = t|T_ < T+). Hence, 



P|^|(T+<T_) 

= ^^^^y-———^ = p^E}iT. = t\T.<T^) = e^it),yt. (11) 

And also denote the probability density of T as Q{t) = P^e}{P = t), so 

Q{t) = e+{t)p+ + e_(t)p- = e+(t) = e_(t). 

It consequently follows a very important corollary that the distribution of waiting cycle 
time T is independent of whether the enzyme E completes a forward cycle or a backward 
cycle, although the probability of these two cycles might be rather different, i.e. 

P{E}{T = t,T+< T_) = p(r+ = t,T+< r_) = e+{t)p+ = e(t)p+, 

and 

P{E}{T = t,T+> T_) = p(r_ = t,T+< r_) = e_(t)p- = e(t)p-. 

Furthermore, we have 

(r+,T+<T_)=p+(r), 
(r_,T_ <T+)=p-(r), 

and 

(r+|r+ < r_) = (r_|r_ < r+) = (t). 
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which means even in the far from equihbrium case (7 >> 1), the dweh times for each 
forward cycle or each backward cycle are identical although their frequencies may be 
rather different (p+ >> p^). 

At the end of this section, we will present an interesting corollary about the entropy 
production rate Cp. Due to the classical result of entropy production rate in a general 
mesoscopic model of biochemical kinetic diagrams [19\ [20l [26] , one has 

ep = (Jf - Jl^)log7, 

JSS 

where log 7 = log -ps is the entropy production rate of the cycle E ES —f EP E, 
and J!^ — Jt^ is its net cycle flux. 

Applying the above results to waiting cycle times, Cp can be expressed as 

= (^-^)log7 

= {p~^ — p^) avepr, (12) 

where avepr = log 7 = log jjs = log -jij^ = log ^ is regarded as the time- 
averaged entropy production rate of the cycle E ES EP — > E. 

Finally, it should be emphasized that this entropy production rate can also be measured 
by — i^-(t)) log ^"""l^l when the time t is large in the single-molecule experiment, 

recalling that J!^ and J^^ can be approximated by and z^-(t) respectively. 



4 Extending to the n-step cycle 

In the previous section, most of the results are obtained through solving a number of 
master equations similar to But now we claim that the same results can be extended 
to the n-step cycle [14J, according the elementary renewal theorem in probability theory 
[U Sec. 3.4, Theorem 4.1,4.2] and general circulation theory of Markov chains [26, Chapter 
1,2], which has already been derived for more than two decades. But the key method is 
also the same "quasi-time-reversal" mapping r introduced in the previous section. 

Below is the summation of the main results in the n-step cycle, which is quite similar 
to the 3-step cycle. 

4.1 Cycle flux and NESS 

We consider a n-step mechanism of the Michaelis-Menten kinetics in which the conversion 
of S into P in the catalytic site of the enzyme is represented as a process separate from 
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release of P from the enzyme. 



E + S ^ ES{= ESi) — ES2--- ^ EP{= ESn-i) ^ E + P, (13) 

k-l k^2 fc-(n-l) 

in which and are second-order rate constants. 

If there is only one enzyme molecule, then from the enzyme perspective, the kinetics 
are stochastic and cyclic, with the pseudo-first-order rate constants ki = k^cs and k-n = 
k^^cp where cs and cp are the sustained concentrations of substrate S and P in the 
steady state. 

This system is at chemical equilibrium if and only if 

kik2k^ ■■■kn 



k-ik-2k-s ■ ■ ■ k^n 

In the nonequilibrium case, 

kik2k-i ■■■kn 



1. (14) 



7/1, (15) 



fc_lfc_2^-3 ■ ■ - k-y 

and = kpT In j is well known as the cellular phosphorylation potential. 

Denote a n-dimensional matrix Q = {qij}nxn in which qi^i+i = ki, qi^i-i = A;„(j_i), 
i = 2, • • • ,n - 1, qi^2 = h,qn,i = kn,qi,n = k-n,qn,n-i = ^-(n-1)) and others are all zero. 
And let D{H) be the determinant of Q with rows and columns indexed by the index set 
H. 

Then according to [26l Theorem 2.1.2], the enzyme turnover rate of 5 — > P, which 
corresponds to the net flux of the n-step cycle, can be expressed as 

.ss _ fclfcs/cs ■■■kn- k_ik_2k-3 ■■■k-n _ jss _ jss /ifi^ 



where 



is the forward cycle flux, and 



kik2k3 ■■■k 

•J J- 



T.r=,,2,-,nD{Wy 

k^ik^2k-3 ■ ■ ■ k^n 



E^=l,2,■■■,nD{{^ry 

is the backward cycle flux. 

It should be noticed that the expression of J2i=i2 -- n-^({^}^) equivalent to the 
King-Altman method Chapter 4] but more general and applicable. Furthermore, it can 
be easy simulated by mathematical softwares, such as Matlab and Mathematica. 

The net cycle flux can also be expressed as the Michaelis-Menten steady-state flux of 
(HI, i.e. 

kscs - kpcp 
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where the definitions of kg, kp, K^Sj and K^p are much more comphcated than the 
3-step cycle. That is just Eq. (2.46) in [1]. 

Also similar to the 3-step cycle, and can be rigorously proved to be the averaged 
numbers of the forward and backward cycles per time respectively due to ergodic theory 
[261 Theorem 2.1.2], i.e. 

= lim -v{t), 

t^oo t 
t-^oo t 

= lim -v-{t), (17) 

t— >oo t 

where v+{t) and v-it) are the number of occurrences of forward and backward cycles up 
to time t, and fit) = f+i^) ~ ^-(*)- 

4.2 Mean waiting cycle times 

Starting from the free enzyme state three kinds of waiting cycle times can be defined. 
T represents the waiting time for the occurrence of a forward or a backward cycle, 
represents the waiting time for the occurrence of a forward cycle, and r_ represents the 
waiting time for the occurrence of a backward cycle respectively. 

According to the elementary renewal theorem |8i Sec. 3. 4, Theorem 4.1,4.2], 

1 1 



(T) = lim 



Similarly, 



t-oo (t) + V- (t) jf + .r' 

(T+) = lim ^ ^ 



and 



4.3 



(T) = lim = —. 



The stepping probabilities p'^{t) and p (t) up to time t are just the fractions of v+it) and 
i/_(t) from the statistical point of view in experiments, i.e. 



According to Eq(T71 one can get the eventual stepping probability 

+ def , , 

= hm p [t) 



ss 



= (18) 
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Moreover, due to Eq. p9|) in the next subsection, one can prove that the forward 
stepping probabiHty could also be defined as P^e}(T+ < T_), which means the 

probability that the particle first completes a forward cycle before a backward one, starting 



from the initial state {E}. Similarly, the backward stepping probability could be defined 
as p~ P^E}{T- < too. 
Consequently, 



P 



jfTJ^ ~ (r_)' 



and 



p+ JSS (rp^ ^ 

A/i = ksT log — = ksT log ^ = ksT log— —. 
4.4 Generalized Haldane equality 

Also introduce the same one-to-one mapping r for the trajectory of the n-step kinetic, 
which belongs to the event {T-f < T_}, mapped to its "quasi-time-reversal" one. 

And recall that the number of each forward step in the original trajectory uj belonging 
to {T-i- < T_} of the n-step model is one more than that in its corresponding quasi-time- 
reversal trajectory rco belonging to {T^ > T_}, and on the contrary the number of each 
backward steps in lo is one less than that in ruj. And the dwell times of the trajectory uj 
and ruj are mapped quite well such that the difference between lo and ruj are only exhibited 
in their sequences of states. 

Consequently, the most important observation is that the ratio of the probability 
density of every trajectory uj in {T+ < T_} with respect to its "quasi-time-reversal" 
ru! in {r+ > T_} is invariable, which is surprisingly always equal to the constant 7 = 

klk2k3---kn 



k-\k-2k-i---k-n ' 

On the other hand, the trajectory map r is a one-to-one correspondence between the 
trajectory sets {T+ < T_} and {T+ > T_}. More particularly, for each t > 0, the map r 
is also actually a one-to-one correspondence between the trajectory sets {r+ = t < T_} 
and {r+ > T_ = t}. 

Therefore, 

p+=7p-, (19) 

which can be used to prove the results of stepping probabilities in the previous subsection. 
Then one arrives at the generalized Haldane equality in the version of distribution, 

e+(t) = P{E}{T+ = t\T+ < T_) = PiE}{T- = t\T_ < r+) = e_(t), vt > o, (20) 
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and 

Q{t) = P{E}{T = t) = Q+{t)p+ + e-(t)p- = e+(t) = Q-{t). 

It consequently follows a very important corollary that the distribution of waiting cycle 
time T is independent of whether the enzyme E completes a forward cycle or a backward 
cycle, although the probability of these two cycles might be rather different. 

Hence, the generalized Haldane equality in the version of conditional expectation is 

(r+|r+ < r_) = (r_|r_ < t+) = (r), (21) 

which means even in the far from equilibrium case, the dwell times for each forward cycle 
or each backward cycle are the same although their frequencies might be rather different. 

Similar to the previous section, we present an interesting corollary about the entropy 
production rate = {J^ — Jt^) log 7, and 

= {p^ — p^)avepr, (22) 

where avepr = log 7 = log 7^ = log |^ = log ^ is regarded as the time- 
averaged entropy production rate of the n-step cycle. 

Furthermore, the following statements are equivalent to each other: 

1) This n-step system is at chemical equilibrium; 

2) The cellular phosphorylation potential A^u vanishes, i.e. 7=1; 

3) The forward and backward cycle fluxes are identical, i.e. = J'l^; 

4) The waiting times for the forward and backward cycles are identical, i.e. (T-f ) = (T_); 

5) The eventual stepping probabilities of the forward and backward cycles = P - 

Finally, it should be emphasized that this entropy production rate Cp can also be mea- 
sured by — log when the time t is large in the single-molecule experiment, 
recalling that Jif and can be approximated by and z^-(t) respectively. 



5 Experimental and theoretically based evidence 

Several results proved in the present article have been observed and discovered in the 
recently reported single-molecule experiment [3] of kinesin, which is one of the most im- 
portant molecular motor proteins. The time trajectories of single kinesin molecules have 
been measured for different external forces and for different ATP concentrations, recording 
the number of forward and backward steps i^-(t)) and the stepping probabilities 
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(p~^{t), p^{t)), which are also cahed fractions of forward and backward steps. They in- 
vestigated the physical mechanism of the kinesin step, and found that both forward and 
backward 8-nm steps occur on the microsecond timescale without mechanical substeps on 
this timescale. It was also shown in [3l Fig. 3] that the time constants for the ensemble 
averaged forward and backward steps are very similar, which is just the generalized Hal- 
dane equality ) in the version of conditional expectation ()2ip . However, this interesting 
observation has not attached enough importance to in [3] and there was no theoretical 
analysis either. 

Almost published at the same time, Kolomeisky, A.B., et.al. [27J put forward a discrete- 
time biased random walk model, applying the first-passage-time method [35j and splitting 
probability theory |47J Chap. XI], to provide explicit expressions for the fractions of 
forward and backward steps and dissociations, and most important to conclude that the 
mean dwell times to move forward, backward, or irreversible detach are equal to each 
other, independent of ATP concentrations or external forces \27\ Fig. 3]. The concept of 
splitting probability in |27l B7] is the same as the stepping probability in the present 
paper, and the mean dwell times to move forward and backward in the discrete-time 
model in [27] just correspond to the conditional expectation of r+ given < T_ and the 
conditional expectation of T_ given T_ < T-f in the continuous-time model of the present 
paper respectively, hence what they conclude was also the generalized Haldane equality 
in the version of conditional expectation (Eq. [2T]) . They also claimed that these forward 
and backward dwell times should be independent of what direction the motor protein will 
go in the next step, although the probability of these steps might be rather different. 
Nonetheless, they didn't notice the forward and backward fluxes and generalized Haldane 
equality in the version of distribution (I20p . and it is a pity that they didn't consider the 
continuous-time case which is actually more difficult to prove. 

Meanwhile, Kou, S.C., et.al. [28j summarized their theoretical understanding of single- 
molecule kinetics, and focused on the conditions under which a single- molecule Michaelis- 
Menten equation for the reciprocal of the mean stochastic waiting time T for individual 
turnovers ( [28i Eqs. 14, 26, 30, 34]). As have been mentioned in the introduction, the 
model they built is the simplest irreversible Michaelis-Menten mechanism, and the state 
space of their stochastic model (Markov chain) actually only contains two states {E and 
ES), which is just why they can just only use the ordinary differential equations to get 
the explicit distribution function f{t) of the waiting time or its approximations, avoiding 
to apply the strong Markov property which is the basic method to compute the first- 
passage-time problems in stochastic processes. So their method can not be generalized to 
more complicated cases, and generally speaking, the explicit distribution function f{t) can 
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rarely be analytically obtained in a multi-state and reversible stochastic model. Moreover, 
they didn't notice the existence of forward and backward waiting cycle times T+ and T_, 
and neglected many insightful observations. 

Afterwards, Qian, H. and Xie, X.S.[3l] studied a semi-Markov model of single-enzyme 
turnover in nonequilibrium steady states with sustained concentrations of substrates and 
products, since in some sense, the general result for any enzyme kinetics, if there is only one 
free enzyme state, can be mapped to a semi-Markov process. Then they gave a brief proof 
to the generalized Haldane equality in the version of distribution (j20p and also expressed 
the nonzero chemical driving force A/i as ksTlogjj^^. Hence a part of our results in 
the present paper are a special case of the rigorous semi-Markov result that Wang and 
Qian have obtained by the elementary renewal theorem |401l48j. But they didn't explicitly 
distinguish the conditional waiting cycle time and the absolute waiting cycle times, and 
may cause some ambiguities. 

At the end of this section, what should be paid more attention to in the statistical 
data analysis of the experiment is to distinguish the data of waiting cycle times T, and 
r_. In reference [3], they only recorded the data of T, and divided them into two classes 
according to whether have completed a forward or a backward cycle. Consequently they 
found the mean values of the data in these two classes are very similar. But they haven't 
realized that the distributions of the data in these two classes should also be very similar, 
and actually they are data of the conditional waiting cycle times rather than the absolute 
waiting cycle times T, and T_, because in the nonequilibrium case, (T4.) and (T_) can 
not be identical and must be both greater than (T). See Fig. [4] for an illustrative example. 

According to the strong Markov property in the theory of stochastic processes, we 
claim that the data of and T_ can also be obtained from the same trajectory recorded 
in the experiment (Fig. H]), and we believe that other main results in the present paper can 
also be discovered from the same experiment data, especially the relationships between 
the cycle fluxes, mean cycle times and eventual stepping probabilities, which we have 
summarized in the previous section. 

6 Discussion 

Deterministic, nonlinear mathematical models usually based on the law of mass action have 
been traditionally used for modelling biological systems [10^ [33] , while nowadays stochastic 
fluctuations observed in most living organisms, such as evidences in the single-molecule 
approach |29[ [2] , have changed the way biophysical or biochemical problems are presented 
and have been recognized as a major important effect in cell biology. Stochastic models in 
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Figure 4: The solid line illustrates ideal data on single-enzyme cycling as a function of 
time, with distinguished data analysis of T, T+ and T_. See text for details. 

biochemistry have already provided important insights and quantitative characterizations 
of a wide range of biochemical systems [T9l[3nl [47l [TT| [T2| [39l [52] . 

In single-molecule experiments, the microscopic motion of an enzyme molecule under- 
goes rapid thermal fluctuation due to its incessant collisions with the solvent molecules, 
and therefore, the data obtained are inevitably stochastic [49j. The main results in the 
present paper are actually based on one type of measurements in single-molecule enzymol- 
ogy, which records the stochastic conformational dynamics of an enzyme turnover (called 
^trajectories") [29]. From the perspective of the stochastic process, a trajectory is a sta- 
tionary stochastic process which can be analyzed by statistical methods. And moreover 
based on the ergidic theory, which is an elementary law in both statistical physics and 
the mathematical theory of stochastic process, an arbitrary single trajectory surprisingly 
contains all the information of the stochastic system. 

However, it is often thought that the noise added to the biological models only provides 
moderate refinements to the behaviors otherwise predicted by the classical deterministic 
system description, while in the present paper, it is quite obvious that the main problems 
discussed here are impossible even to be put forward in a deterministic model. So it may 
be necessary to reconstruct the main biological theory based on the stochastic models in 
order to explain the experiment results of single molecule tracking. 

For instance, applying the statistical methods to analysis the recorded single trajec- 
tory, experimentalists can not only directly measure the distribution of the waiting cycle 
(turnover) times T, T+ and T_, but also the probability cycle fluxes J^^, and 
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widely used in this paperwhich can be approximated by the time-averaged number of oc- 
currences of the cycle, according to the general ergodic theory of cycle fluxes [261 Theorem 
2.1.2] and elementary renewal theorem |8[ Sec. 3.4, Theorem 4.1,4.2]. Furthermore, it is 
important to notice that the distributions of single-molecule properties and the probabil- 
ity cycle fluxes can only be presented and measured through single-molecule experiments 
rather than the ensemble average. 

In conclusion, the single-molecule enzymology is still in its early ages, and in my 
personal opinion, the generalized Haldane equality as well as the relationship between mean 
waiting cycle times and cycle fluxes may be the first interesting discovery in this active field, 
which could be applied to instruct the data analysis of single-molecule trajectories. In the 
future, we believe that more and more important phenomenon and theory in traditional 
enzymology, such as inhibition and activation, cooperativity and multi-enzyme systems, 
will enter into the single-molecule enzymology, which might stimulate very significant 
developments both in experiment and theory. 

In addition, from the purely mathematical point of view, we also believe that it is 
valuable to further extend similar results to the theory of much more general Markov 
chains, though much technical work remains to be done. 
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